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INDEHISCENT1 (IND1) genomic 



(3361) 
INDl 



3 <110> 
4 
5 

7 <120> 
8 

10 <130> 

12 <140> 

13 <141> 

15 <150> 

16 <151> 

18 <150> 

19 <151> PRIOR FILING DATE: 1999-06-25 
21 <160> NUMBER OF SEQ ID NOS : 9 
23 <170> SOFTWARE: Patentln Ver. 2.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 3856 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Arabidopsis thaliana * 

30 <220> FEATURE \S 

31 <223> OTHER INFORMATION 

33 <220> FEATURE: 

34 <221> NAME/KEY: CDS 

35 <222> LOCATION: (2765). 

36 <223> OTHER INFORMATION 

38 <400> SEQUENCE: 1 

39 ctctagacca tctactatcc ggttgttgac ccttaaagct tttgaagact actagaataa 60 
41 tgcaaatacc atatgtccat atccatcctt ttcttttgtt tgaactgaac attctaattt 120 
4 3 tgtaaaagaa aaaaccttat gttaatatca ccgtaggcaa aaaaaatatc tcatcatatt 180 
4 5 aaatttttat tataagatta tacattctct cgttgtaaga gttactccaa ttgcaagtgt 240 
47 tgtattaact aataaaaagg acgaaagtag gaagcttata attaattgat gttgcatagt 300 
49 actggtatat tgttgatgaa tataacaagt atgaacatta atgcatgaaa cggggtattt 360 
51 tgtcttgaac tcattaaagg caatgtgaaa agaagatgtg aggtctcatt ttgaaaattt 420 
53 atcttctagc tttgtcgatt ttaaatctat gaaatgaacg caacatatag aaatttcatg 480 
55 tggacaacga catttagacg gtatcttaat tagaccgatt aattagtaat atacttatat 540 
57 atataattag tggtgattat aagtttactt atccacttga gaatttaaac aatgggcaat 600 
59 accttaatgt cgaaagaagc cgtccccact tcgtgtaatg agttatgggg gagagatcct 660 
61 gttaaatcgt caaataaaac aacttaagaa ctagaaattg acaccaaaaa tcataaagag 720 
63 aacgttgaag aagtcattta tcgtatccag ctcatatttc ctagctaaga tcaaatcaag 780 
65 gccgttgaaa gggcttgtaa gaaaatgtcg aagaaaccgt ggggtttaga agaaagacaa 840 
67 gaaatagaag aacaatgatg ttaaattgcc tattttggtg tataggagtt gtcaaaagag 900 
69 gagagagaga agaaaattag gtcaaaataa tgagcactaa aaatggagac atgtgttgag 960 
71 taactattac aagagcgact tatgcttcct tatggcaatg atatccaaac caaagtgcaa 1020 
73 cgctcctttt ttgccctaat ttcgtaaagt ctctctcctt cttcgtcctt aggaaaaacc 1080 
75 ctagaaattt aatcccttgt tcttgatctt gctttttgag taaccatgat tttgaccaca 1140 
77 cactatttct tctatctttt gtggtctata ggattttgct ttatatgtgt ttcttgtatt 1200 
79 gctccgtacg tacgtatacg aatttaaatg gttataacaa ggtttatata aactagcaca 1260 
81 aatgagtcca tgaaatttgt tagcgaaaaa ggtagaaata tattgagtct ttaaacggca 1320 



RECEIVED 

SEP 0 4 2001 

TECH CENTER 160012900 



APPLICANT: Liljegren, Sarah 
Yanofsky, Martin F. 

The Regents of the University of California 

TITLE OF INVENTION: Control of Fruit Dehiscence in Arabidopsis by 
INDEHISCENT1 Genes 
FILE REFERENCE: 19 4 52 A- 000700US 
CURRENT APPLICATION NUMBER: US 09/548,971 
CURRENT FILING DATE: 2000-04-13 

PRIOR APPLICATION NUMBER: US 60/090,649 m 4^ i 

PRIOR FILING DATE : 1998-06-25 D08S Not Comply 

prior application number: us 09/339,998 Corrected Diskette Needed 



file://C:\Crf3\Outhold\VsrI548971 .htm 



8/7/01 



RAW SEQUENCE LISTING DATE: 08/07/2001 

PATENT APPLICATION: US/09/548,971 TIME: 08:16:07 



Input Set : A:\19452a-7.app 

Output Set: N:\CRF3\08072001\l548971.raw 

83 atatatataa ttttgctgca aaacttagct ttaatcatga tctaatgata ttttctttaa 1380 
85 tttcctttgc caaattaatc acatgcacgg atttttggca agttatgtgt cgaattcttc 1440 
87 cattcacaca acactaaact taattagaac tctaggaaat attttaaaat gacaacttta 1500 
89 tcgaaaaaaa tttagttatg aaaacaattc cagaattaaa catgagctat ataatttaag 1560 
91 ataaaatgaa gtaatattga tatgtatgta ataacatatc tgattgcggt aaaaaaaaac 1620 
93 atatctgatt aaattgttca tgcaggccca tgtcactatg atgtcatcac gtttttattt 1680 
95 tcacaataac taatatatat tcaaaaaaat agttttgtca gattaaattt tttttggtgg 1740 
97 tcagctttct ccaacctact aaactagttt ggaatgttct cttctttatt tttctttttc 1800 
99 ttgatttctt atgtttttta tttatggaat tttaagacgg attgtttagg tcgtttctct 1860 
101 cttttcttgt tttctaaagt tacttttgta aactcatctc ctcccaatta gacagtcaat 1920 
103 catatagtta tcttttaata tatgtctagt tgataaaaaa aatgaaaaaa tactggtggt 1980 
105 agttctacta atgtttgtgt aaaaaatctg atattatgaa tctaatcaat ttctttgatc 2040 
107 gtataatgtg ggttaaattt agtaattttt tacataaata agaactgtaa tgttgatgta 2100 
109 tattggggaa tcagtatatt agcttgggta actatacttc tggaaatact tgaagattta 2160 
111 actatttgca aaattataat ttagtcccga aaaatacaga cgacgggaca cgacaacata 2220 
113 taagcaggtt tgaatcttgg aaaattttgt atacataacc tatataaata ctaatgttct 2280 
115 ggttgggttc aaaagccttt tcaaaagttc cattttttaa attcaaggac attttacata 2340 
117 ggaaataagt tgagtcataa aaaataatgg ttattttgta aggttttttt tttgattaaa 2400 
119 acgcacatat taagaagtta gttttttttc actaccaaat atcaattaat ttaaaaccat 2460 
121 gcaaccattc ataaaacaat actattaaag aatataaata atcacaaaat attaaataca 2520 
123 cttaaaattt acatataaat ttacaaaaca tctaattaat tgaaacagaa aggaaaaggt 2580 
125 aaaatatatc ataaaatgag acatatatcc tataaaaaaa aaatgaggca tatgaagtaa 2640 
127 ataataagag acatgcatgt aagcattcgg ttaattaatc gagtcaaaga tatatatcag 2700 
129 taaatacata tgtgtatatt tctggaaaaa gaatatatat attgagaaat aagaaaagat 2760 

131 gaaa atg gaa aat ggt atg tat aaa aag aaa gga gtg tgc gac tct tgt 2809 

132 Met Glu Asn Gly Met Tyr Lys Lys Lys Gly Val Cys Asp Ser Cys 

133 15 10 15 

135 gtc teg tec aaa age aga tec aac cac age ccc aaa aga age atg atg 2857 

136 Val Ser Ser Lys Ser Arg Ser Asn His Ser Pro Lys Arg Ser Met Met 

137 20 25 30 

139 gag cct cag cct cac cat etc etc atg gat tgg aac aaa get aat gat 2905 

140 Glu Pro Gin Pro His His Leu Leu Met Asp Trp Asn Lys Ala Asn Asp 

141 35 40 45 

143 ctt etc aca caa gaa cac gca get ttt etc aat gat cct cac cat etc 2953 

144 Leu Leu Thr Gin Glu His Ala Ala Phe Leu Asn Asp Pro His His Leu 

145 50 55 60 

147 atg tta gat cca cct ccc gaa acc eta att cac ttg gac gaa gac gaa 3001 

148 Met Leu Asp Pro Pro Pro Glu Thr Leu lie His Leu Asp Glu Asp Glu 

149 65 70 75 

151 gag tac gat gaa gac atg gat gcg atg aag gag atg cag tac atg ate 3049 

152 Glu Tyr Asp Glu Asp Met Asp Ala Met Lys Glu Met Gin Tyr Met lie 

153 80 85 90 95 

155 gee gtc atg cag ccc gta gac ate gac cct gee acg gtc cct aag ccg 3097 

156 Ala Val Met Gin Pro Val Asp lie Asp Pro Ala Thr Val Pro Lys Pro 

157 100 105 110 

159 aac cgc cgt aac gta agg ata age gac gat cct cag acg gtg gtt get 3145 

160 Asn Arg Arg Asn Val Arg lie Ser Asp Asp Pro Gin Thr Val Val Ala 

161 115 120 125 

163 cgt egg cgt egg gaa agg ate age gag aag ate cga att etc aag agg 3193 
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164 Arg Arg Arg Arg Glu Arg lie Ser Glu Lys lie Arg lie Leu Lys Arg 

165 130 135 140 

167 ate gtg cct ggt ggt gcg aag atg gac aca get tec atg etc gac gaa 3241 

168 lie Val Pro Gly Gly Ala Lys Met Asp Thr Ala Ser Met Leu Asp Glu 

169 145 150 155 

171 gec ata cgt tac ace aag ttc ttg aaa egg cag gtg agg att ctt cag 3289 

172 Ala lie Arg Tyr Thr Lys Phe Leu Lys Arg Gin Val Arg lie Leu Gin 

173 160 165 170 175 

175 cct cac tct cag att gga get cct atg get aac ccc tct tac ctt tgt 3337 

176 Pro His Ser Gin lie Gly Ala Pro Met Ala Asn Pro Ser Tyr Leu Cys 

177 180 185 190 

179 tat tac cac aac tec caa ccc tga tgaactacac agaagctege tagctagaca 3391 

180 Tyr Tyr His Asn Ser Gin Pro 

181 195 

183 tttggtgtca tcctctcaac ctttttcatg ttgatatatt atatatagat gcataaagat 3451 
185 tcgatccaag attgtatggg tgttttaata ttattattct aagatatatg atgtacaatt 3511 
187 gtgtaccaag tttctttatc ttgatatcat atgcataaat aattggtgaa taaaaagaag 3571 
189 atattgattg taaacaaaaa aaagaagata ttgattgtta attagggttt gatcattctg 3631 
191 tatgaaagct ttggcctgca aattaatttt cgatatatat atatatatat ggagaatata 3691 
193 tatcaaatac ttttttaatt tgactataat ttgtatcaat tatctgaatc tgatgagtgt 3751 
195 aggttatata tggattagca aaaaagaaaa caaccattat tacgcaccta cattaaaaat 3811 
197 catccaccaa agaagaaacc atcctcaaga gggttccctc tagag 3856 

200 <210> SEQ ID NO: 2 

201 <211> LENGTH: 198 K ' 

202 <212> TYPE: PRT « 0, | i gJhJ> 

203 <213> ORGANISM: Arabidopsis thaliana ^ ftflX^ ^ c ^" 

204 <223> OTHER INFORMATION: INDEHISCENT1 (IND1) protein^ 
206 <400> SEQUENCE: 2 
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228 165 170 175 

229 His Ser Gin lie Gly Ala Pro Met Ala Asn Pro Ser Tyr Leu Cys Tyr 

230 180 185 190 

231 Tyr His Asn Ser Gin Pro 

232 195 

235 <210> SEQ ID NO: 3 

236 <211> LENGTH: 2765 

237 <212> TYPE: DNA 

238 <213> ORGANISM: Arabidopsis thaliana 

240 <220> FEATURE: ^ 

241 <223> OTHER INFORMATION: 5' untranslated region IND1 promoter 

243 <400> SEQUENCE: 3 

244 ctctagacca tctactatcc ggttgttgac ccttaaagct tttgaagact actagaataa 60 

245 tgcaaatacc atatgtccat atccatcctt ttcttttgtt tgaactgaac attctaattt 120 

246 tgtaaaagaa aaaaccttat gttaatatca ccgtaggcaa aaaaaatatc tcatcatatt 180 

247 aaatttttat tataagatta tacattctct cgttgtaaga gttactccaa ttgcaagtgt 240 

248 tgtattaact aataaaaagg acgaaagtag gaagcttata attaattgat gttgcatagt 300 

249 actggtatat tgttgatgaa tataacaagt atgaacatta atgcatgaaa cggggtattt 360 

250 tgtcttgaac tcattaaagg caatgtgaaa agaagatgtg aggtctcatt ttgaaaattt 420 

251 atcttctagc tttgtcgatt ttaaatctat gaaatgaacg caacatatag aaatttcatg 480 

252 tggacaacga catttagacg gtatcttaat tagaccgatt aattagtaat atacttatat 540 

253 atataattag tggtgattat aagtttactt atccacttga gaatttaaac aatgggcaat 600 

254 accttaatgt cgaaagaagc cgtccccact tcgtgtaatg agttatgggg gagagatcct 660 

255 gttaaatcgt caaataaaac aacttaagaa ctagaaattg acaccaaaaa tcataaagag 720 

256 aacgttgaag aagtcattta tcgtatccag ctcatatttc ctagctaaga tcaaatcaag 780 

257 gccgttgaaa gggcttgtaa gaaaatgtcg aagaaaccgt ggggtttaga agaaagacaa 840 

258 gaaatagaag aacaatgatg ttaaattgcc tattttggtg tataggagtt gtcaaaagag 900 

259 gagagagaga agaaaattag gtcaaaataa tgagcactaa aaatggagac atgtgttgag 960 

260 taactattac aagagcgact tatgcttcct tatggcaatg atatccaaac caaagtgcaa 1020 

261 cgctcctttt ttgccctaat ttcgtaaagt ctctctcctt cttcgtcctt aggaaaaacc 1080 

262 ctagaaattt aatcccttgt tcttgatctt gctttttgag taaccatgat tttgaccaca 1140 

263 cactatttct tctatctttt gtggtctata ggattttgct ttatatgtgt ttcttgtatt 1200 

264 gctccgtacg tacgtatacg aatttaaatg gttataacaa ggtttatata aactagcaca 1260 

265 aatgagtcca tgaaatttgt tagcgaaaaa ggtagaaata tattgagtct ttaaacggca 1320 

266 atatatataa ttttgctgca aaacttagct ttaatcatga tctaatgata ttttctttaa 1380 

267 tttcctttgc caaattaatc acatgcacgg atttttggca agttatgtgt cgaattcttc 1440 

268 cattcacaca acactaaact taattagaac tctaggaaat attttaaaat gacaacttta 1500 

269 tcgaaaaaaa tttagttatg aaaacaattc cagaattaaa catgagctat ataatttaag 1560 

270 ataaaatgaa gtaatattga tatgtatgta ataacatatc tgattgcggt aaaaaaaaac 1620 

271 atatctgatt aaattgttca tgcaggccca tgtcactatg atgtcatcac gtttttattt 1680 

272 tcacaataac taatatatat tcaaaaaaat agttttgtca gattaaattt tttttggtgg 1740 

273 tcagctttct ccaacctact aaactagttt ggaatgttct cttctttatt tttctttttc 1800 

274 ttgatttctt atgtttttta tttatggaat tttaagacgg attgtttagg tcgtttctct 1860 

275 cttttcttgt tttctaaagt tacttttgta aactcatctc ctcccaatta gacagtcaat 1920 

276 catatagtta tcttttaata tatgtctagt tgataaaaaa aatgaaaaaa tactggtggt 1980 

277 agttctacta atgtttgtgt aaaaaatctg atattatgaa tctaatcaat ttctttgatc 2040 

278 gtataatgtg ggttaaattt agtaattttt tacataaata agaactgtaa tgttgatgta 2100 

279 tattggggaa tcagtatatt agcttgggta actatacttc tggaaatact tgaagattta 2160 

280 actatttgca aaattataat ttagtcccga aaaatacaga cgacgggaca cgacaacata 2220 
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291 t.^.^tt J-Jttttjt ™.CC tatataaat, jt-tjttjt 

282 ggttgggttc aaaagccttt tcaaaagttc a ggttttttt tttgattaaa 2400 

283 ggaaataagt tgagtcataa aaaataatgg "attttgta ttaaaaccat 2460 

284 acgcacatat taagaagtta Jttttttttc actaccaaat «^ attaaataca 2520 

285 gcaaccattc ataaaacaat ^attaaag ^ata aa a g ga aaaggt 2580 

286 cttaaaattt acatataaat ttacaaaaca y tatgaagtaa 2640 

287 aaaatatatc ataaaatgag acatatatcc tataaaaaaa aaatg gg tafcatatcag 2700 

288 ataataagag acatgcatgt aagcattcgg ^aattaatc J aagaaaag at 2760 

289 taaatacata tgtgtatatt tctggaaaaa gaatatatat attgag ^ 

290 gaaaa 

293 <210> SEQ ID NO: 4 

294 <211> LENGTH: 496 

295 <212> TYPE: DNA , 

296 <213> ORGANISM: Arabidopsis thaliana 

HI lilt ™»1«IO»: 3' untranslated region » P^oter 

301 <400> SEQUENCE : 4 , fttno totc atcctctcaa cctttttcat 60 

302 atgaactaca cagaagctcg ctagctagac atttggtgtc £cctt ttaat 120 

303 gttgatatat tatatataga tgcataaaga ttcgatccaa ^ttgt^gg gg & ^ 

304 attattattc taagatatat gatgtacaat tgtgtaccaa 9 aa aagaagat 240 

305 tatgcataaa taattggtga ataaaaagaa ^tattgatt £aaac Mattliattt 30 0 

306 attgattgtt aattagggtt tgatcattct ^atgaaagc tt gactataa 360 

307 tcgatatata tatatatata tggagaatat Jtatcaaata ^ttttt J 420 

308 tttgtatcaa ttatctgaat ctgatgagtg taggttatat atgg g catcctcaag 4 80 

309 acaaccatta ttacgcacct acattaaaaa tcatccacca aagaag ^ 

310 agggttccct ctagag 

313 <210> SEQ ID NO: 5 

314 <211> LENGTH: 5622 

315 <212> TYPE: DNA , 

316 <213> ORGANISM: Arabidopsis tnaliana 

III lilt ^INFORMATION: SHATT ERPROOF 1 (SHPl, genomic 

321 <220> FEATURE: 

322 <221> NAME/KEY: modif ied_base 

323 <222> LOCATION: (935). .(941) 

324 <223> OTHER INFORMATION : n = any nucleotide 

326 <400> SEQUENCE: 5 afI aaatttta acaatgccag 60 

327 agatctgcaa cagtgaaaag agaaaaoaaa "tggacttga a ^tttg a g 9 ^ 

328 agataatgct tattccctaa tatgttgcca jccaagtgtc aaattggctt 18Q 

329 gatttctgta tcagtggtca tatttgtgga tccaacgtat tcatcatcaa g ^ ^ 

330 ttgctttcag tgcaattcta attcacacgt "aactttaa jatgcatg aa 3Q0 

331 cttottcact aagacacaat acggcaaacc "tcagatta aatttacacg 3 60 

332 ataattaacc tcataatcaa gattcaatgt "ctaaatat ^atgg accac 420 

333 gaagattaga tacgtatatt agta^ttta £ctttcgtt tgtg g^ 480 

334 ctcatagata gtaatatcat tgtcaaattc ctctcgg * taactttt tttttaatta 540 

335 tttaagccta aaagtagtgt attcgcatat g-ttatcgt cctaactt tta 6Q0 

336 acaaaaaaat cgaaaagaaa Jtaatctgtt aaatattttt t 9 tgag 660 

337 gtttctattt aaaaaatgct tgaaatttga cagttatg tcactcgagg 720 

338 cgatgtctag atactcagaa tttaatcaag atgtcttatc aaatttgttg g 

w«r Ins been detected in the Sequence Listing. 
Usrof n and/or \JJ ™? insuve a corresponding 

Review the Sequence Usin g to g ^ ± ^ of 
explanation is presented in the <220> to 
' each sequence using n or Xaa. 
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:342 M:341 W: (46) n n n or "Xaa" used, for SEQ ID# : 5 
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